Highband spectrum envelope estimation of telephone speech using hard/soft-classification

نویسندگان

  • Yasheng Qian
  • Peter Kabal
چکیده

The bandwidth for telephony is generally defined to be from 300–3400 Hz. This bandwidth restriction has a noticeable effect on speech quality. We present an algorithm which recovers the missing highband parts from telephone speech. We describe an MMSE estimator using hard/soft-classification to create the missing highband spectrum envelope. The classification is motivated by acoustic phonetics: voiced vowels and consonants, and unvoiced phonemes demonstrate different characteristic spectra. The classification also captures gender differences. A hard classification on phoneme characteristic parameters, such as a voicing degree and a pitch lag, reduces the MMSE of the highband spectrum envelope estimates. An estimator using HMM-based softclassification can further bring down the estimated highband spectrum distortion by taking the time evolution of the spectra into consideration. Objective measures (mean log-spectrum distortion) and spectrograms confirm the improvement noted in informal subjective tests.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Wideband Speech Recovery from Narrowband Speech Using Classified Codebook Mapping

Speech sounds occupy 8 kHz or more of bandwidth. However, current public telephone networks limit the speech bandwidth to 300–3400 Hz. Telephone speech is characterized by thin and muffled sounds, and degraded speaker identification. We describe an algorithm which generates the missing highband components from the narrowband speech signal. The algorithm is based on three acoustic-phonetic class...

متن کامل

Pseudo-wideband Speech Reconstruction from Telephone Speech

The bandwidth of telephone speech is limited to a 300 – 3400 Hz bandwidth. The sound quality is much lower than for broadcast radio and audio compact discs. We present an algorithm to regenerate the missing highband components (3.4–7 kHz). The highband spectrum recovery is based on a Line Spectrum Frequency (LSF) VQ codebook mapping from the narrowband speech to the high frequency components. T...

متن کامل

Dual-mode wideband speech recovery from narrowband speech

The present public telephone networks trim off the lowband (50–300 Hz) and the highband (3400–7000 Hz) components of sounds. As a result, telephone speech is characterized by thin and muffled sounds, and degraded speaker identification. The lowband components are deterministically recoverable, while the missing highband can be recovered statistically. We develop an equalizer to restore the lowb...

متن کامل

Artificial bandwidth extension of speech signals using MMSE estimation based on a hidden Markov model

ABSTRACT We present an algorithm to derive 7 kHz wideband speech from narrowband “telephone speech”. A statistical approach is used that is based on a Hidden Markov Model (HMM) of the speech production process. A new method for the estimation of the wideband spectral envelope is proposed, using nonlinear statespecific techniques to minimize a mean square error criterion. In contrast to common m...

متن کامل

The effect of highband harmonic structure in the artificial bandwidth expansion of telephone speech

The quality of narrowband telephone speech can be improved by artificial bandwidth expansion (ABE), which generates missing frequency components above the telephone bandwidth using only information from the narrowband speech signal. Straightforward bandwidth expansion methods do not reproduce the harmonic structure of voiced sounds properly, but a pitch-adaptive technique can be used to approxi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004